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ITERATION  SCHEMES  FOR  SOLVII^G  RECTANGULAR  GAMES 
Harold  N.  Shapiro 

1.  Introduction.   In  connection  with  linear  programming 
problems,  or  in  the  consideration  of  zero-sum  two  person  games 
in  normal  form  the  problem  arises  of  computing  the  value  of  and 
optimal  strategies  for  a  rectangular  game  given  by  an  m  y.  n 
matrix  A  =  (a, .).   Various  Iteration  schemes  have  been  proposed 
for  achieving  this  and  several  have  been  proved  to  "converge" 
(i.e.  accomplish  the  desired  purpose).   These  schemes  vary  in  the 
degree  of  their  complexity,  and  with  regard  to  what  assertions 
can  be  made  a  priori  concerning  the  "rate  of  convergence".   For 
example,  Von  Neumann  [1]  has  proposed  an  iteration  scheme  which 
though  cumbersome  for  hand  computations  is  easily  coded  for 
machine  work,  and  for  which  he  proves  a  very  good  a  priori  esti- 
mate of  the  rate  of  convergence.   On  the  other  hand,  the  scheme 
of  George  Brown  and  Julia  Robinson  [2]  is  easily  carried  out 
without  a  computing  machine,  and  has  also  been  proved  to  converge 
In  this  note  it  is  proposed  to  consider  this  latter  scheme  and 

by  suitably  modifying  and  elaborating  upon  the  method  of  [2] 
provide  for  it  an  a  priori  estimate  of  the  rate  of  convergence. 

2.  Description  of  the  iteration  scheme,  and  preliminary 
lemmas .   The  iteration  scheme  of  George  Brown  and  Julia  Robinson 
is  based  upon  the  following  procedure  of  fictitious  play.   Given 
the  rxx  game  matrix  A  =  (a.  .),  we  start  with  two  initial  vectors 
U(0)  =  {U^(0),  . ..,U^(0)),  and  V(0)  =  ( V^( 0) ,  .  .  .  , V^ ( 0) ) .   U^(0) 

is  player  I's  Initial  relative  estimate  of  what  he  expects  to  win 


should  he  play  the  j-th  column.   Player  I  then  chooses  the  rovj 
i(0)  which  is  such  that  U, ,qv  =  max  U(0),  (max  U(0)  denotes  the 
maximum  component  of  TJ(0)),  and  player  TI  chooses  the  j(C) 
column  wherein  V./„s  =  min  V(0).   That  is,  player  I  picks  the 
row  from  which  he  expects  to  get  the  most  and  player  II  picks 
the  column  which  he  expects  will  give  player  I  the  least.   The 
tvjo  players  then  revise  their  estimates  of  the  relative  values 
of  the  rows  and  columns  to: 

V(l)  =  V(0)  +  R.(Q)   , 
TJ(1)  =  U(0)  +  C.^Q^       , 

(where  R.  denotes  the  i-th  row  of  A  and  C.  denotes  the  j-th 
column  of  A).   Then  for  the  second  play  players  I  and  II  pick 
the  i(l)  row  and  j(l)  column  respectively,  where  U.,-,  (l)=max  U(l) 
and  V./,^(l)  =  min  V(l).   As  the  play  proceed  in  this  way  we  have 

(2.1)  U(n+1)  =  U(n)  +  C^^^^   , 

(2.2)  V(n+1)  =  V(n)  +  R^(^^   , 


where 
and 


Y^.(^j(n)  =  min  V(n)   , 

^i(n)  ^'^^  "^  ^^-^  ^^^^ 
We  then  obtain  that 


s 
(2.3)  U(n)  =  U(0)  +  ^  ^i,(n)C,   , 

k=l  ^    ^ 


(2.^)  V(n)  =  V(0)  +  ^ '^i,(n)R.   , 

k=l   ^    " 

where  ''^^.(n)  equals  the  number  of  solutions  of  j(^)  =  k, 

0  <  j2.  <  n-1;  and  Tj^ln)  equals  the  number  of  solutions  of  i(i')  =  k, 

s  r 

0  <  i  <   n-1.   Clearly  >   <r,  (n)  =  n  and  >   x.Arx)    =  n,  so  that 

~   ~  k=l   ^  k=l  ^ 

dividing  (2.3)  and  {Z.\\)    by  n  we  obtain 

fp  <\  U(n)  _  TJ(0)  .  ^  ^k^^^  C 

and 

r?  A^  V(n)  _  V(0)  .  ^  "^'k^"^  J, 

where  .-^(n)  =  (o-^(n)/ny  .  .  . ,  crg(n)/n)  and  X(n)  =  ('c.|^(n)/n,  .  .  . , 
t  (n)/n)  are  strategy  vectors. 

i-'e  next  prove  several  lemmas  which  are  needed  for  later 
arguments . 

Lemma  2.1.   If  v  is  the  value  of  the  game  matrix  A  then 


1^   n\               max  U(n)  -  min  U(0)  ^  „  ^  min  V(n)  -  max  V(Q) 
,2.7) _ >v>  -  . 

Proof:   Letting  yi.(n)  =  o",  (n)/n  we  have  from  (2.5) 

"^^^  ^^^^  >  "^^n  IT(0)  ^  ^^^  (^  y^(n)C-  )    , 
n    —    n  k=T 

and  since 

3  r_ 

max  (^^ y,  (n)C,  )  =  max  (^2-  ^iiryir^^))  -  ^   ' 

fel   ^    ^     1   k=l   ^^  ^ 

the  left  inequality  in  (2.7)  follows.   The  right  inequality  may 
be  deduced  similarly  from  (2.6). 


For   convenience   we    introduce   the    following  notations: 

/\jj  Y^'^^  ~  ""^^-^  U(n)  -  min  V(n)  , 

A,y  TT^'^i)  ~  ^^-^  V(n)  -  min   U(n)  , 

/\rj  TT(n)  =  max  U(n)  -  min  U(n)  , 

Ay  Y(n)  =  max  V(n)  -  min  V(n)  , 

and   note    the    trivial    identity 

(2.8)  Au,v(^^    ^   Av.u^^)    =   Au,U^^)    -"   Av,v(^)       • 
We   next    introduce    the   vectors 

U(n)    =  U(n)    -    (max  U(0))1^      , 
V(n)    =  V(n)    -    (min  V(0))1 

where  1   and  1   are  vectors  with  all  components  1,  and  of 
r      s 

dimensions  r  and  s  rexpectively ,   V'e  note  also  that 

(2.9)  max  U(0)  =  min  V(0)  =  0   , 
and 

(2.10)  Au  v^^^  -  A.  ^(n)  =  Au  v^O)   • 

U,V  y^^  u,v 

Lemma  2.2.   If  max  IT(0)  =  min  V(0)  =  0,  then  Ay  u^"^^  -  ^* 

Proof:   Letting  yK.(n)  =  (X  (n)/n,  x,  (n)  =  t,  (n)/nj  we  have 
from  (2.5),  (2.6), 


min  U(n)  ^  max  U(0)  ^   .   /"t^   /     \n    ^    ^ 


3_ 

k~I 


and 

r 


max  V(n)  ^  min  V(0)  ^      ,  xr /  \d  \  ^ 

>  ~ — -   +  max  (>   X,  (n)R,  )  >  v   , 

^  k=l   ^    ^ 


n 


and  the  lemma  follows. 

From  (2.8)  we  see  that  under  the  assumption  of  the  above 
lemma  we  have 


(2.11) 


Ait  v^"^  <  Atj.tj(^)  -^  Av.v^")   ' 


^II,V 


whereas  from  Lemma  2.1  we  have  (in  all  cases) 


(2.12) 


An  v(="^^  ^  -Av  ri(0)   . 


iU,V 


v,u^ 


Also,  under  the  assumption  of  Lemma  2.2  we  see  that  for 
a  =  max  la 


(2.13) 


■ij" 


An  v^'^)   1  2an      .    l/' 


^U,V 


In   carrying   out   the   details    of   our   estimate    of   the    rate    of 
convergence    a   simple   numerical   inequality   will  be   needed,    which 

we   now   provide   as 

T  ->    ->         \  f-      .      -,.   I         J        ^   o(  r+s-1 )  ( r+s-2)       ,, 

Lemma  2.3.      if  m+n  >  1^   and  n  >  2'  ''  ,    then 


(r^-s-3)' 
pr+s-3        p   r+s-2 


n 


r-?-3-3 
r+s-2 


>  1 


Proof: 


1    >  -L  +    1     >         1  .  1 

^         nr       IL  -     r+s-3        rrr+s}(r+s-3) 


so  that 


(r+s-3) 
1  <  (1  -  2'  ^+3-2  )  j^^"*"^^^^"^^"^^ 

(r+s-3)^ 


(r+s-3)^    r+s-3 


3.   Estimation  of  the  rate  of  convergence.   The  estimation 
of  the  rate  of  convergence  focuses  on  ascertaining  how  fast 
/V,  ^(n)/n  goes  to  0  as  n  goes  to  infinity.   Since  from  (2.12) 
we  have 

(3.1)  Au,v^'"^)/^  -  -Av,u(°^/'"  ' 

we  need  only  focus  on  obtaining  an  upper  estimate  for  this  ratio. 
For  this  we  have 

Theorem  3.1.   If  max  U(0)  =  min  V(0)  =  0  we  have 

(3.2)  An  v(^)/^  <  a  2^  ^ 


and  in  general 


■U,V' 

r+s-2 
n 


a  2^^^ 


(3.3)  Au  v^"^^/^  -  ^ifrz  ^  ^u  v(°^/^     ' 

^r-^s-2 

where  a  =  max  [a. .|. 
i,  J     *' 

Proof:   Prom  (2.9)  and  (2.10)  we  note  that  (3-2)  implies 
(3-3)  so  that  we  need  only  derive  {3.2).      This  will  be  achieved 
by  induction  on  r+s . 

For  r+s  =  2,  the  matrix  A  =  (a)  is  1x1,  so  that  if 
U(0)  =  V(0)  =  0,  U(n)/n  =  V(n)/n  =  a.   Thus  in  this  case 


/\„  „(n)/n  =  0,  and  (3.2)  certainly  true,   ve  now  assume  that  If 
max  U(0)  =  min  V(C)  =  0,  (3.2)  holds  for  all  matrices  for  which 
r+3  <  k,  k  >  3;  and  consider  the  case  r+s  =  k. 

Next  we  shall  prove  that  given  integers  n  and  T,  0  <  T  <  n, 
then  ei  ther 

or        C\^  -1    1 


<v 


1 


,r+s-l  m   r+s-3 


(ii)   Au,V^"^  -  Au,V^^-^^  <  a2^^^-^  T 

For  suppose  (i)  were  false;  so  that  A^  Y(n)  >  l+aT .   Then,  from 
(2.11)  we  obtain  that  either  /\y  y(n)  >  2aT  or  /\_^   ^(n)  >  2aT. 
'■'e  will  proceed  on  the  assumption  that  /\-^   u^"^-)  ^  2aT,  (an 
entirely  analogous  argument  covers  the  other  alternative),  so 
that 

(3.14)  max  U(n)  -  min  U(n)  >  2aT   . 

Suppose  that  max  U(n)  occurs  in  the  i-th  component  and  min  U(n) 
occurs  in  the  j-th  component.   Then  since  a  component  cannot 
change  by  more  than  a  in  one  step  of  the  process,  (3.1|)  implies 

(3.5)  U.(n-w)  -  U.(n-w)  >  0   , 

for  all  integers  w,  0  <  w  <  T.   Thus  we  see  that  the  j-th  compo- 
nent never  occurs  as  a  maximum  in  the  T  steps  preceding  the  n-th 
step.   Let  A^'^'^  be  the  matrix  obtained  from  A  by  deleting  the 
j-th  row,  and  let  U  "^'(m)  be  the  vector  obtained  from  U(m)  by 
deleting  the  j-th  component.   If  we  then  apply  the  iteration 
scheme  to  A^'^',  taking  as  initial  vectors  U  "^  (n-T)  and  V(n-T), 


8 


we  clearly  have  that  the  first  T+1  vectors  of  the  respective 
U  -^ '    arid  V  series  are 

U^  J^n-w)   ,     w  =  T,T-1,  ...,0   ; 

V(n-w)   ,     w  =  T,T-1, . . .,0   . 
Setting 

''•(j)(n-w)  =  U^J^n-w)  -  [max  U^  J^n-T)]l^_-L   , 


(3.6) 


V(n-w)  =  V(n-x,j)  -  [min  V(n-T)]lg   , 


we  may  apply  the  inductive  hypothesis  to  obtain 

1-   1 
(3.7)      raaxU^-'^n)  -  min  V(n)  <  a  2^"^^"^  T    ^^^'^      . 

Prom  (3.6)  and  the  manner  in  which  j  was  chosen,  it  follows  that 
the  left  side  of  (.3.7)  equals  /\_y  ^(n)  -  /\y  yd^-T)  so  that  we 
obtain  ( ii ) • 

Tf  n  <  2'^^'^"^^^^'^^'^^  utillzinfT  (2.13)  we  have 

1  1-       1 


A/    ,         o        ^   o      r+s-2  r+s-2 

,-.  „(n)    <  2an  <  2an  n 


^U,V 


1 L^      .    1-   1 


^  o   oT+s-l    "  r+s-2  _  „or+s  „    r+s-2 
<  2a  <:^      n         =  a2    n 


a2 


Thus  in  this  case  /\^   y(n)/n  <  -x/J+iTI'  ^"^  ^^®  induction  is 
completed.   I''e  therefore  need  only  consider  the  case  wherein 

^  „(r+s-l)(r+3-2) 
n   -^   c.  • 

For  a  fixed  positive  integer  T,  we  call  n  an  integer  of  the 
first  kind  if  (i)  is  satisfied,  otherwise  we  call  n  an  integer 


of  the  second  kind.   In  view  of  (2.13),  if  n  <  T,  then  n  is  of 
the  first  kind. 

Let  q  =  [n/T].   Then  among  the  integers  n,n-T,n-2T, . . . , n-qT, 
there  are  integers  of  the  first  kind  (since  n-qT  is  of  the  first 
kind).   Let  n-rT  be  the  largest  integer  of  this  set  which  is  of 
the  first  kind.   Then 

1-  1 


<  (a  2^"-"^-^  T   ^■^^-^)r      i  1+aT 


<  na   2^''^-^   T   """^"^    +   1+aT      ,  (since  x    ±  n/T)       . 

Thus    for   all  n  >  T   >   0  we   have 


(3.8)  Au,v(^)/^  ^  a(2^-^^-l  T   '^''^   +   i|T/n)       . 

Consider  first  the  case  wherein  r+s  =  3.   The  result  has 

been  established  for  n  <  2^  ^""^-^  ^  ^ ''^^"^  ^  =  k-      For  n   >  k,    and 
T  =  2,  (3.8)  becomes 

Au,V^^^/"  <  8a/n  , 

which  is  precisely  (3-2)  in  this  case. 

We  next  consider  r+s  >  I4-.   From  Lemma  2.3  we  obtain  the 
existence  of  an  integer  T  --   Tin)    such  that 

( r+3-3)^  r+s -3  r+s-3 

r+s-2     r+s-2    m/  \    or+s-3   r-i-s-2 
2         n       <  T(n)  <  2    -^  n 


10 


Recalling  that  we  are  carrying  the  assumption  n  >  p^  ^"^"^"^^  ^  ^•'^^-2^, 
it  is  easily  verified  that  T(n)  satisfies 

(aj   T(n)  <  n   , 


(b)   2^-^^-^  T(n)  ^-^^-^  <  [^T(n)/n   , 


and 


c)   8T(n)/R  <  2^''^   n 


r+s    r+s-2 


Choosing  T  =  T(n)  in  (3.8),  by  means  of  (ai,  (b),  and  (c)  above, 
(3.2)  follows,  and  the  induction  is  completed. 

In  conclusion  we  note  that  Theorem  3.1  together  with 
Lemma  2.1  implies  that 


V  - 


max  U(n: 


n 


r+s -2 
n 


V  - 


min  V(n) 
n 


r+s-2 


n 


14-.   The  case  of  symmetric  games.   In  the  case  of  symmetric 

sanies,  for  which  the  game  matrix  is  skew  symmetric  the  above 

scheme  is  considerably  simplified  by  the  fact  that  we  may  take 

U(0)  =  -V(0)  and  need  only  write  down  one  of  the  two  sequences 

U(n),  V(n).   Thus  for  example,  we  need  only  consider  the 

recurrence 

V(n+1)  =  V(n)  +  R./  ^   , 
/  i(n)   ' 


whe  re 


^i(n)  ^^^    ~   ^^^^   ^^^' 


The  result  of  the  previous  section  then  asserts  that  min  V(n)/n 
tends  to  0  like  c/n  '       ~~ ,    for  an  m  by  m  skew-symmetric  matrix. 


11 


However,  the  method  of  proof  given  above  provides  a  more  rapid 
rate  of  convergence  in  this  case  —  namely  like  c/n  '    •   This 
stems  from  the  fact  that  in  the  argument  which  led  to  the 
alternative  (ii)  in  Section  3,  when  we  delete  a  row  (or  column) 
WG  may  (because  of  the  symmetry)  delete  the  same  column  (or  ro:>?) 
also  . 

'•'hether  or-   not  the  estimates  of  rate  of  convergence 
obtained  above  are  best  possible  for  this  iteration  scheme  is 
not  known.   In  all  probability  they  are  not.   In  fact,  there  is 
some  evidence  that  the  convergence  is  faster  if  some  of  the 
rows  or  colurans  are  innessential .   This  suggests  the  possibility 
that  the  above  method  could  be  elaborated  upon  so  as  to  provide 
an  estimate  of  the  form  (3-3)  in  which  r  and  s  would  be  the 
niireber  of  rows  and  columns  respectively  of  the  essential  part 
of  the  game  matrix. 
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